Enable Text2text task on ipex #1054

jiqing-feng · 2024-12-09T05:36:14Z

Add IPEXModelForSeq2SeqLM support, optimized by torch.compile. It forced attn_implementation="static" to be compatible with torch.compile.

HuggingFaceDocBuilderDev · 2024-12-09T05:41:33Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

jiqing-feng · 2024-12-09T09:24:26Z

Hi @echarlaix @IlyasMoutawwakil . Please review this PR, thanks! BTW, the failed tests are not related to my changes.

IlyasMoutawwakil · 2024-12-09T10:17:54Z

docs/source/ipex/inference.mdx

@@ -14,7 +14,7 @@ Optimum Intel can be used to load models from the [Hub](https://huggingface.co/m

 ## Loading

-You can load your model and apply IPEX optimizations (apply torch.compile for non-generation tasks). For supported architectures like LLaMA, BERT and ViT, further optimizations will be applied by patching the model to use custom operators.
+You can load your model and apply IPEX optimizations (apply torch.compile except text-generation tasks). For supported architectures like LLaMA, BERT and ViT, further optimizations will be applied by patching the model to use custom operators.


I don't understand what you mean here, I see torch.compile being applied to text-generation task, why does it say "except text-generation tasks" here ?

Actually, we didn't apply torch.compile in text-generation task which means IPEXModelForCausalLM doesn't have torch.compile in init. And generation tasks are also excluded in IPEXModel.init when calling torch.compile

Signed-off-by: jiqing-feng <[email protected]>

optimum/intel/ipex/modeling_base.py

jiqing-feng · 2024-12-11T04:50:16Z

Hi @IlyasMoutawwakil . I have fixed your comments, please take the 2nd round review. Thanks!

Signed-off-by: jiqing-feng <[email protected]>

jiqing-feng · 2024-12-17T11:59:47Z

Hi @IlyasMoutawwakil . It should be ready to merge, thanks!

jiqing-feng marked this pull request as ready for review December 9, 2024 09:23

IlyasMoutawwakil reviewed Dec 9, 2024

View reviewed changes

jiqing-feng added 6 commits December 9, 2024 12:31

enable IPEXModelForSeq2SeqLM

3888824

Signed-off-by: jiqing-feng <[email protected]>

set static cache

f9fa807

Signed-off-by: jiqing-feng <[email protected]>

add tests for IPEXModelForSeq2SeqLM

202df43

Signed-off-by: jiqing-feng <[email protected]>

add docs

4488073

Signed-off-by: jiqing-feng <[email protected]>

fix readme

16fecf8

Signed-off-by: jiqing-feng <[email protected]>

Merge branch 'main' into text2text

de501f4

jiqing-feng commented Dec 10, 2024

View reviewed changes

optimum/intel/ipex/modeling_base.py Outdated Show resolved Hide resolved

IlyasMoutawwakil requested a review from echarlaix December 10, 2024 10:32

jiqing-feng added 3 commits December 11, 2024 12:06

refactor compile

4225bf0

Signed-off-by: jiqing-feng <[email protected]>

fix check

2ac7ecf

Signed-off-by: jiqing-feng <[email protected]>

fix ruff check

24b988c

Signed-off-by: jiqing-feng <[email protected]>

yao-matrix approved these changes Dec 13, 2024

View reviewed changes

Merge branch 'huggingface:main' into text2text

5c4f9a1

jiqing-feng mentioned this pull request Dec 16, 2024

Enable quant model support #1074

Open

jiqing-feng added 2 commits December 16, 2024 14:21

fix check

22458d2

Signed-off-by: jiqing-feng <[email protected]>

fix tests

c11e01c

Signed-off-by: jiqing-feng <[email protected]>

IlyasMoutawwakil approved these changes Dec 17, 2024

View reviewed changes

fix opt tests

8ff5fb8

Signed-off-by: jiqing-feng <[email protected]>

IlyasMoutawwakil removed the request for review from echarlaix December 17, 2024 12:08

IlyasMoutawwakil merged commit a76be08 into huggingface:main Dec 17, 2024
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable Text2text task on ipex #1054

Enable Text2text task on ipex #1054

jiqing-feng commented Dec 9, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 9, 2024

jiqing-feng commented Dec 9, 2024

IlyasMoutawwakil Dec 9, 2024

jiqing-feng Dec 10, 2024 •

edited

Loading

jiqing-feng commented Dec 11, 2024

jiqing-feng commented Dec 17, 2024

Enable Text2text task on ipex #1054

Enable Text2text task on ipex #1054

Conversation

jiqing-feng commented Dec 9, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Dec 9, 2024

jiqing-feng commented Dec 9, 2024

IlyasMoutawwakil Dec 9, 2024

Choose a reason for hiding this comment

jiqing-feng Dec 10, 2024 • edited Loading

Choose a reason for hiding this comment

jiqing-feng commented Dec 11, 2024

jiqing-feng commented Dec 17, 2024

jiqing-feng commented Dec 9, 2024 •

edited

Loading

jiqing-feng Dec 10, 2024 •

edited

Loading